Method for generating information sequence segments using the quality functional of processing models
Annotation
The constantly emerging need to increase the efficiency of solving classification problems and predicting the behavior of objects under observation necessitates improving data processing methods. This article proposes a method for improving the quality indicators of machine learning models in regression and forecasting problems. The proposed processing of information sequences involves the use of input data segmentation. As a result of data division, segments with different properties of observation objects are formed. The novelty of the method lies in dividing the sequence into segments using the quality functional of processing models on data subsamples. This allows you to apply the best quality models on various data segments. The segments obtained in this way are separate subsamples to which the best quality models and machine learning algorithms are assigned. To assess the quality of the proposed solution, an experiment was performed using model data and multiple regression. The obtained values of the quality indicator RMSE for various algorithms on an experimental sample and with a different number of segments demonstrated an increase in the quality indicators of individual algorithms with an increase in the number of segments. The proposed method can improve RMSE performance by an average of 7 % by segmenting and assigning models that have the best performance in individual segments. The results obtained can be additionally used in the development of models and data processing methods. The proposed solution is aimed at further improving and expanding ensemble methods. The formation of multi-level model structures that process, analyze incoming information flows and assign the most suitable model for solving the current problem makes it possible to reduce the complexity and resource intensity of classical ensemble methods. The impact of the overfitting problem is reduced, the dependence of processing results on the basic models is reduced, the efficiency of setting up basic algorithms in the event of transformation of data properties is increased, and the interpretability of the results is improved.
Keywords
Постоянный URL
Articles in current issue
- Optical properties of the interface between indium tin oxides thin films and laser-deposited single-walled carbon nanotubes
- The xanthene fluorescent dyes usage for the microplastics in soil detection and for phytotests
- Investigation of the effect of the applied voltage to the control electrodes of a lithium niobate phase modulator on the intensity distribution at the ends of channel waveguides and on parasitic amplitude modulation
- Assessment of the quantitative composition of hydrate formation inhibitors by their infrared spectra
- Magneto optical properties of atmospheric air molecules
- Femtosecond laser modification of the ZnO:Ag sol-gel films within dichroism emergence
- Insights from Keldysh theory to plasma electron density in liquid water under excitation wavelength scaling
- Luminescent and colorimetric properties of silica-coated spherical cadmium telluride nanocrystals in an external electric field
- The sliding-mode observer for PMSM field-oriented sensorless control with adaptive filter and PLL
- Improving the algorithm for processing data from multisensor system in tasks of determining quality parameters in vegetable oils
- Lithium tetraborate co-doping with transition and alkali metals
- Analysis of chemical interactions during filling a cesium vapor cell for a quantum magnetometer
- Polymer-salt synthesis and study on structure of vanadium-doped yttrium-aluminum garnet
- Enhancing healthcare data security in cloud environments with dual authentication and optimal key-tuned encryption
- Elimination of distortions of weak images of astronomical objects on the example of Saturn, Jupiter and their satellites
- Smartphone video motion deblur order model
- An approach to detecting L0-optimized attacks on image processing neural networks via means of mathematical statistics
- On the influence of a concentrated inclusion on the spectrum of natural vibrations of a string and Bernoulli-Euler beam
- Restoration of unsteady heat flow from a thermal energy accumulator by solving the inverse heat conduction problem
- Management of space surveillance radar temporal resource on fuzzy set theory
- Quantification and modeling of ankle biomechanical characteristics